AITopics | clickstream data

Collaborating Authors

clickstream data

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A Utility-Mining-Driven Active Learning Approach for Analyzing Clickstream Sequences

Wang, Danny Y. C., Jordanger, Lars Arne, Lin, Jerry Chun-Wei

arXiv.org Artificial IntelligenceOct-9-2024

In rapidly evolving e-commerce industry, the capability of selecting high-quality data for model training is essential. This study introduces the High-Utility Sequential Pattern Mining using SHAP values (HUSPM-SHAP) model, a utility mining-based active learning strategy to tackle this challenge. We found that the parameter settings for positive and negative SHAP values impact the model's mining outcomes, introducing a key consideration into the active learning framework. Through extensive experiments aimed at predicting behaviors that do lead to purchases or not, the designed HUSPM-SHAP model demonstrates its superiority across diverse scenarios. The model's ability to mitigate labeling needs while maintaining high predictive performance is highlighted. Our findings demonstrate the model's capability to refine e-commerce data processing, steering towards more streamlined, cost-effective prediction modeling.

prediction, sequence, shap value, (12 more...)

arXiv.org Artificial Intelligence

2410.07282

Country: Europe > Norway > Western Norway > Vestland > Bergen (0.05)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Services > e-Commerce Services (0.59)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

TRACE: Transformer-based user Representations from Attributed Clickstream Event sequences

Black, William, Manlove, Alexander, Pennington, Jack, Marchini, Andrea, Ilhan, Ercument, Markeviciute, Vilda

arXiv.org Artificial IntelligenceSep-2-2024

For users navigating travel e-commerce websites, the process of researching products and making a purchase often results in intricate browsing patterns that span numerous sessions over an extended period of time. The resulting clickstream data chronicle these user journeys and present valuable opportunities to derive insights that can significantly enhance personalized recommendations. We introduce TRACE, a novel transformer-based approach tailored to generate rich user embeddings from live multi-session clickstreams for real-time recommendation applications. Prior works largely focus on single-session product sequences, whereas TRACE leverages site-wide page view sequences spanning multiple user sessions to model long-term engagement. Employing a multi-task learning framework, TRACE captures comprehensive user preferences and intents distilled into low-dimensional representations. We demonstrate TRACE's superior performance over vanilla transformer and LLM-style architectures through extensive experiments on a large-scale travel e-commerce dataset of real user journeys, where the challenges of long page-histories and sparse targets are particularly prevalent. Visualizations of the learned embeddings reveal meaningful clusters corresponding to latent user states and behaviors, highlighting TRACE's potential to enhance recommendation systems by capturing nuanced user interactions and preferences

journey, proceedings, sequence, (13 more...)

arXiv.org Artificial Intelligence

2409.12972

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom (0.04)
Europe > Italy > Apulia > Bari (0.04)

Genre: Research Report > New Finding (0.94)

Industry: Information Technology > Services > e-Commerce Services (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

ClickTree: A Tree-based Method for Predicting Math Students' Performance Based on Clickstream Data

Rohani, Narjes, Rohani, Behnam, Manataki, Areti

arXiv.org Artificial IntelligenceMar-1-2024

ClickTree: A Tree-based Method for Predicting Math Students' Performance Based on Clickstream Data The prediction of student performance and the analysis of students' learning behavior play an important role in enhancing online courses. By analysing a massive amount of clickstream data that captures student behavior, educators can gain valuable insights into the factors that influence academic outcomes and identify areas of improvement in courses. In this study, we developed ClickTree, a tree-based methodology, to predict student performance in mathematical assignments based on students' clickstream data. We extracted a set of features, including problem-level, assignment-level and student-level features, from the extensive clickstream data and trained a CatBoost tree to predict whether a student successfully answers a problem in an assignment. The developed method achieved an AUC of 0.78844 in the Educational Data Mining Cup 2023 and ranked second in the competition. Furthermore, our results indicate that students encounter more difficulties in the problem types that they must select a subset of answers from a given set as well as problem subjects of Algebra II. Additionally, students who performed well in answering end-unit assignment problems engaged more with in-unit assignments and answered more problems correctly, while those who struggled had higher tutoring request rate. The proposed method can be utilized to improve students' learning experiences, and the above insights can be integrated into mathematical courses to enhance students' learning outcomes. In recent years, massive amounts of log data have been collected from students' interactions with online courses, providing researchers with valuable information to analyze student behavior and its impact on academic performance (Yi et al., 2018; Aljohani et al., 2019). By examining clickstream data, educators can gain deeper insights into students' study habits, navigation patterns, and levels of engagement (Wen and Rosé, 2014; Li et al., 2020; Matcha et al., 2020).

assignment, clickstream data, student, (16 more...)

arXiv.org Artificial Intelligence

2403.14664

Country:

North America > United States > New York (0.04)
Europe > Middle East > Malta > Port Region > Southern Harbour District > Valletta (0.04)
Asia > Thailand > Chiang Mai > Chiang Mai (0.04)

Genre:

Research Report > New Finding (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)
Education > Curriculum > Subject-Specific Education (1.00)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Analyzing the Capabilities of Nature-inspired Feature Selection Algorithms in Predicting Student Performance

Trask, Thomas

arXiv.org Artificial IntelligenceOct-7-2023

Predicting student performance is key in leveraging effective pre-failure interventions for at-risk students. As educational data grows larger, more effective means of analyzing student data in a timely manner are needed in order to provide useful predictions and interventions. In this paper, an analysis was conducted to determine the relative performance of a suite of nature-inspired algorithms in the feature-selection portion of ensemble algorithms used to predict student performance. A Swarm Intelligence ML engine (SIMLe) was developed to run this suite in tandem with a series of traditional ML classification algorithms to analyze three student datasets: instance-based clickstream data, hybrid single-course performance, and student meta-performance when taking multiple courses simultaneously. These results were then compared to previous predictive algorithms and, for all datasets analyzed, it was found that leveraging an ensemble approach using nature-inspired algorithms for feature selection and traditional ML algorithms for classification significantly increased predictive accuracy while also reducing feature set size by up to 65 percent.

algorithm, dataset, student, (12 more...)

arXiv.org Artificial Intelligence

2308.08574

Country:

Asia > Japan (0.05)
North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Asia > China (0.04)

Genre:

Research Report (0.50)
Instructional Material (0.47)

Industry:

Education > Assessment & Standards > Student Performance (0.82)
Education > Educational Setting > Online (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.92)

Add feedback

MLOps: How to Operationalise E-Commerce Product Recommendation System

#artificialintelligenceMay-25-2022, 21:23:01 GMT

One of the most common challenges in an e-commerce business to build a well-performing product recommender and categorisation model. A product recommender is used to recommend similar products to users so that total time and money spent on platform per user will be increased. There is also a need to have a model to categorise products correctly since there might be some wrongly categorised products in those platforms especially where most of content is generated by users as in case of classified websites. A product categorisation model is used to catch those products and place them back into their right categories to improve overall user experience on the platform. This article has 2 main parts.

categorisation model, operationalise e-commerce product recommendation system, platform, (7 more...)

#artificialintelligence

Industry: Information Technology > Services > e-Commerce Services (0.71)

Technology:

Information Technology > e-Commerce (0.71)
Information Technology > Artificial Intelligence > Machine Learning (0.43)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.43)

Add feedback

Clustering and Semi-Supervised Classification for Clickstream Data via Mixture Models

Gallaugher, Michael P. B., McNicholas, Paul D.

arXiv.org Machine LearningDec-16-2020

Finite mixture models have been used for unsupervised learning for some time, and their use within the semi-supervised paradigm is becoming more commonplace. Clickstream data is one of the various emerging data types that demands particular attention because there is a notable paucity of statistical learning approaches currently available. A mixture of first-order continuous time Markov models is introduced for unsupervised and semi-supervised learning of clickstream data. This approach assumes continuous time, which distinguishes it from existing mixture model-based approaches; practically, this allows account to be taken of the amount of time each user spends on each webpage. The approach is evaluated, and compared to the discrete time approach, using simulated and real data.

classification, continuous time model, time model, (16 more...)

arXiv.org Machine Learning

1802.04849

Country:

North America > United States > Texas (0.04)
North America > United States > Florida > Palm Beach County > Boca Raton (0.04)
North America > United States > California > Alameda County > Hayward (0.04)
(2 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.47)

Add feedback

Solving One of the Biggest Challenges for AI-Based Search Engines: Relevance

#artificialintelligenceAug-4-2020, 19:35:47 GMT

Let's learn how to implement ClickModels in order to extract Relevance from clickstream data. These steps tend to be what is already necessary for implementing an effective enough search engine system for a given application. Eventually, the requirement to upgrade the system to deliver customized results may arise. Doing so should be simple. One could choose from a set of machine learning ranking algorithms, train some selected models, prepare them for production and observe the results.

information retrieval, machine learning, natural language, (13 more...)

#artificialintelligence

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.73)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.71)

Add feedback

Surveys without Questions: A Reinforcement Learning Approach

Sinha, Atanu R, Jain, Deepali, Sheoran, Nikhil, Khosla, Sopan, Sasidharan, Reshmi

arXiv.org Artificial IntelligenceJun-11-2020

The 'old world' instrument, survey, remains a tool of choice for firms to obtain ratings of satisfaction and experience that customers realize while interacting online with firms. While avenues for survey have evolved from emails and links to pop-ups while browsing, the deficiencies persist. These include - reliance on ratings of very few respondents to infer about all customers' online interactions; failing to capture a customer's interactions over time since the rating is a one-time snapshot; and inability to tie back customers' ratings to specific interactions because ratings provided relate to all interactions. To overcome these deficiencies we extract proxy ratings from clickstream data, typically collected for every customer's online interactions, by developing an approach based on Reinforcement Learning (RL). We introduce a new way to interpret values generated by the value function of RL, as proxy ratings. Our approach does not need any survey data for training. Yet, on validation against actual survey data, proxy ratings yield reasonable performance results. Additionally, we offer a new way to draw insights from values of the value function, which allow associating specific interactions to their proxy ratings. We introduce two new metrics to represent ratings - one, customer-level and the other, aggregate-level for click actions across customers. Both are defined around proportion of all pairwise, successive actions that show increase in proxy ratings. This intuitive customer-level metric enables gauging the dynamics of ratings over time and is a better predictor of purchase than customer ratings from survey. The aggregate-level metric allows pinpointing actions that help or hurt experience. In sum, proxy ratings computed unobtrusively from clickstream, for every action, for each customer, and for every session can offer interpretable and more insightful alternative to surveys.

customer, machine learning, reinforcement learning, (20 more...)

arXiv.org Artificial Intelligence

doi: 10.1609/aaai.v33i01.3301257

2006.06323

Country:

North America > United States > New York > New York County > New York City (0.04)
Asia > India (0.04)

Genre:

Questionnaire & Opinion Survey (0.95)
Research Report (0.82)

Industry: Information Technology > Services (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Predicting Online Item-choice Behavior: A Shape-restricted Regression Perspective

Nishimura, Naoki, Sukegawa, Noriyoshi, Takano, Yuichi, Iwanaga, Jiro

arXiv.org Artificial IntelligenceMay-24-2020

This paper examines the relationship between user pageview (PV) histories and their item-choice behavior on an e-commerce website. We focus on PV sequences, which represent time series of the number of PVs for each user--item pair. We propose a shape-restricted optimization model that accurately estimates item-choice probabilities for all possible PV sequences. This model imposes monotonicity constraints on item-choice probabilities by exploiting partial orders for PV sequences, according to the recency and frequency of a user's previous PVs. To improve the computational efficiency of our optimization model, we devise efficient algorithms for eliminating all redundant constraints according to the transitivity of the partial orders. Experimental results using real-world clickstream data demonstrate that our method achieves higher prediction performance than that of a state-of-the-art optimization model and common machine learning methods.

artificial intelligence, machine learning, optimization problem, (17 more...)

arXiv.org Artificial Intelligence

2004.08519

Country:

Asia > Japan > Honshū > Kantō > Ibaraki Prefecture > Tsukuba (0.04)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Japan > Honshū > Kantō > Kanagawa Prefecture (0.04)

Genre:

Overview (1.00)
Research Report > New Finding (0.93)

Industry: Information Technology > Services > e-Commerce Services (0.88)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.90)

Add feedback

Dropout Prediction over Weeks in MOOCs via Interpretable Multi-Layer Representation Learning

Jeon, Byungsoo, Park, Namyong, Bang, Seojin

arXiv.org Artificial IntelligenceFeb-4-2020

Massive Open Online Courses (MOOCs) have become popular platforms for online learning. While MOOCs enable students to study at their own pace, this flexibility makes it easy for students to drop out of class. In this paper, our goal is to predict if a learner is going to drop out within the next week, given clickstream data for the current week. To this end, we present a multi-layer representation learning solution based on branch and bound (BB) algorithm, which learns from low-level clickstreams in an unsupervised manner, produces interpretable results, and avoids manual feature engineering. In experiments on Coursera data, we show that our model learns a representation that allows a simple model to perform similarly well to more complex, task-specific models, and how the BB algorithm enables interpretable results. In our analysis of the observed limitations, we discuss promising future directions.

clickstream, information, representation, (15 more...)

arXiv.org Artificial Intelligence

2002.01598

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)

Genre:

Research Report (1.00)
Instructional Material > Online (1.00)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback